Adapt-Mix: learning local genetic correlation structure improves summary statistics-based analyses

نویسندگان

  • Danny S. Park
  • Brielin Brown
  • Celeste Eng
  • Scott Huntsman
  • Donglei Hu
  • Dara G. Torgerson
  • Esteban Gonzàlez Burchard
  • Noah Zaitlen
چکیده

MOTIVATION Approaches to identifying new risk loci, training risk prediction models, imputing untyped variants and fine-mapping causal variants from summary statistics of genome-wide association studies are playing an increasingly important role in the human genetics community. Current summary statistics-based methods rely on global 'best guess' reference panels to model the genetic correlation structure of the dataset being studied. This approach, especially in admixed populations, has the potential to produce misleading results, ignores variation in local structure and is not feasible when appropriate reference panels are missing or small. Here, we develop a method, Adapt-Mix, that combines information across all available reference panels to produce estimates of local genetic correlation structure for summary statistics-based methods in arbitrary populations. RESULTS We applied Adapt-Mix to estimate the genetic correlation structure of both admixed and non-admixed individuals using simulated and real data. We evaluated our method by measuring the performance of two summary statistics-based methods: imputation and joint-testing. When using our method as opposed to the current standard of 'best guess' reference panels, we observed a 28% decrease in mean-squared error for imputation and a 73.7% decrease in mean-squared error for joint-testing. AVAILABILITY AND IMPLEMENTATION Our method is publicly available in a software package called ADAPT-Mix available at https://github.com/dpark27/adapt_mix.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Impact of Summary Writing with Structure Guidelines on EFL College Students’ Rhetorical Organization: Integrating Genre-Based and Process Approaches

This study aimed at investigating the impact of writing on Iranian EFL college students’ rhetorical organization. Thirty Iranian female undergraduate students majoring in English at Al-zahra University participated in the current study. The writing instructions included two stages, each lasting for four weeks. The participants were assigned to a control group and an experimental group according...

متن کامل

DISSCO: direct imputation of summary statistics allowing covariates

BACKGROUND Imputation of individual level genotypes at untyped markers using an external reference panel of genotyped or sequenced individuals has become standard practice in genetic association studies. Direct imputation of summary statistics can also be valuable, for example in meta-analyses where individual level genotype data are not available. Two methods (DIST and ImpG-Summary/LD), that a...

متن کامل

Investigation of Genetic Diversity and Structure Analysis of Different Citrus Genotypes Using ISSR Markers

In breeding programs, it is necessary having knowledge of the relatedness and genetic diversity in germplasm pools. The spread of cultivated regions and the high levels of production indicates citrus importance in the global economy. Therefore, 110 citrus genotypes were evaluated using 12 ISSR markers. Overall, 154 polymorphic bands were scored with an average of 12.8 alleles per primer. The po...

متن کامل

Genetic Differentiation of Draa Indigenous Breed and Relationships with Other Goat Populations Assessed by Microsatellite DNA Markers

Moroccan goats are characterized by the presence of different populations identified only based on their phenotypes. The objectives of this study were to assess the genetic differentiation of the Draa goat breed and to analyze its genetic structure and its relationships with other local populations using 12 microsatellite markers. The screening was done in South Eastern and Southern Morocco on ...

متن کامل

GWIS: Genome-Wide Inferred Statistics for Functions of Multiple Phenotypes.

Here we present a method of genome-wide inferred study (GWIS) that provides an approximation of genome-wide association study (GWAS) summary statistics for a variable that is a function of phenotypes for which GWAS summary statistics, phenotypic means, and covariances are available. A GWIS can be performed regardless of sample overlap between the GWAS of the phenotypes on which the function dep...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 31  شماره 

صفحات  -

تاریخ انتشار 2015